Variable Impedance Control - A Reinforcement Learning Approach
نویسندگان
چکیده
One of the hallmarks of the performance, versatility, and robustness of biological motor control is the ability to adapt the impedance of the overall biomechanical system to different task requirements and stochastic disturbances. A transfer of this principle to robotics is desirable, for instance to enable robots to work robustly and safely in everyday human environments. It is, however, not trivial to derive variable impedance controllers for practical high DOF robotic tasks. In this contribution, we accomplish such gain scheduling with a reinforcement learning approach algorithm, PI (Policy Improvement with Path Integrals). PI is a model-free, sampling based learning method derived from first principles of optimal control. The PI algorithm requires no tuning of algorithmic parameters besides the exploration noise. The designer can thus fully focus on cost function design to specify the task. From the viewpoint of robotics, a particular useful property of PI is that it can scale to problems of many DOFs, so that RL on real robotic systems becomes feasible. We sketch the PI algorithm and its theoretical properties, and how it is applied to gain scheduling. We evaluate our approach by presenting results on two different simulated robotic systems, a 3-DOF Phantom Premium Robot and a 6-DOF Kuka Lightweight Robot. We investigate tasks where the optimal strategy requires both tuning of the impedance of the end-effector, and tuning of a reference trajectory. The results show that we can use path integral based RL not only for planning but also to derive variable gain feedback controllers in realistic scenarios. Thus, the power of variable impedance control is made available to a wide variety of robotic systems and practical applications.
منابع مشابه
Learning Variable Impedance Control Learning Variable Impedance Control
One of the hallmarks of the performance, versatility, and robustness of biological motor control is the ability to adapt the impedance of the overall biomechanical system to different task requirements and stochastic disturbances. A transfer of this principle to robotics is desirable, for instance to enable robots to work robustly and safely in everyday human environments. It is, however, not t...
متن کاملLearning variable impedance control
One of the hallmarks of the performance, versatility, and robustness of biological motor control is the ability to adapt the impedance of the overall biomechanical system to different task requirements and stochastic disturbances. A transfer of this principle to robotics is desirable, for instance to enable robots to work robustly and safely in everyday human environments. It is, however, not t...
متن کاملNovel Framework of Robot Force Control Using Reinforcement Learning
Over the past decades, robotic technologies have advanced remarkably and have been proven to be successful, especially in the field of manufacturing. In manufacturing, conventional position-controlled robots perform simple repeated tasks in static environments. In recent years, there are increasing needs for robot systems in many areas that involve physical contacts with human-populated environ...
متن کاملHierarchical Functional Concepts for Knowledge Transfer among Reinforcement Learning Agents
This article introduces the notions of functional space and concept as a way of knowledge representation and abstraction for Reinforcement Learning agents. These definitions are used as a tool of knowledge transfer among agents. The agents are assumed to be heterogeneous; they have different state spaces but share a same dynamic, reward and action space. In other words, the agents are assumed t...
متن کاملVariable Impedance Control for Rehabilitation Robot using Interval Type-2 Fuzzy Logic
In this study, a novel variable impedance control for a lower-limb rehabilitation robotic system using voltage control strategy is presented. The majority of existing control approaches are based on control torque strategy, which require the knowledge of robot dynamics as well as dynamic of patients. This requires the controller to overcome complex problems such as uncertainties and nonlinearit...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010